Input Decay: Simple and Effective Soft Variable Selection

نویسندگان

  • Nicolas Chapados
  • Yoshua Bengio
چکیده

To deal with the overfitting problems that occur when there are not enough examples compared to the number of input variables in supervised learning, traditional approaches are weight decay and greedy variable selection. An alternative that has recently started to attract attention is to keep all the variables but to put more emphasis on the “most useful” ones. We introduce a new regularization method called input decay that exerts more relative penalty on the parameters associated with the inputs that contribute less to the learned function. This method, like weight decay and variable selection, still requires to perform a kind of model selection. Successful comparative experiments with this new method were performed both on a simulated regression task and a real-world financial prediction task.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Simulation and Experimental Verification of Closed Loop Operation of Buck / Boost DC-DC Converter with Soft Switching

A major problem in an isolated DC/DC converters operating at high switching frequencies is the attendant switching losses in the semiconductor devices. This can be reduced by introducing either zero-voltage switching (ZVS) or zero-current switching (ZCS) of the semiconductor switches. This paper deals with the simulation, design, fabrication and experimental evaluation of a novel soft-switching...

متن کامل

Selection rules in three-body B decay from factorization

Extending the dynamics underlying the factorization calculation of two-body decays, we propose simple selection rules for nonresonant three-body B decays. We predict, for instance, that in the Dalitz plot of B → D0π+π−, practically no events should be found in the corner region of E(π) ≤ ΛQCD as compared with the corner of E(π−) ≤ ΛQCD. We also predict that there should be very few three-body d...

متن کامل

Input variable scaling for statistical modeling

Input variable scaling is one of the most important steps in statistical modeling. However, it has not been actively investigated, and autoscaling is mostly used. This paper proposes two input variable scaling methods for improving the accuracy of soft sensors. One method statistically derives the input variable scaling factors; the other one uses spectroscopic data of a material whose content ...

متن کامل

Efficiency of Indirect Selection to Improve Yield and Drought Tolerance in Populations Derived from Inter-Specific Hybridization of Soft Flower

The use of appropriate selection methods are important to increase yield in breeding programs. Utilization of selection indices is one of the most effective method for selection of superior genotypes to improve the complex traits such as seed yield. In order to evaluate the efficiency of different selection methods, 63 families of two F3 populations derived from inter-specific hybridization of ...

متن کامل

Application of genetic algorithm (GA) to select input variables in support vector machine (SVM) for analyzing the occurrence of roach, Rutilus rutilus, in streams

Support vector machine (SVM) was used to analyze the occurrence of roach in Flemish stream basins (Belgium). Several habitat and physico?chemical variables were used as inputs for the model development. The biotic variable merely consisted of abundance data which was used for predicting presence/absence of roach. Genetic algorithm (GA) was combined with SVM in order to select the most important...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002